f Applications 
in Plant Sciences 



Applications in Plant Sciences 2014 2(1): 1300058 

Primer Note 



In silico mining of microsatellites in coding 

SEQUENCES OF THE DATE PALM (ArECACEAE) GENOME, 
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• Premise of the study: To complement existing sets of primarily dinucleotide microsatellite loci from noncoding sequences of 
date palm, we developed primers for tri- and hexanucleotide microsatellite loci identified within genes. Due to their conserved 
genomic locations, the primers should be useful in other palm taxa, and their utility was tested in seven other Phoenix species 
and in Chamaerops, Livistona, and Hyphaene. 

• Methods and Results: Tandem repeat motifs of 3-6 bp were searched using a simple sequence repeat (SSR)-pipeline package 
in coding portions of the date palm draft genome sequence. Fifteen loci produced highly consistent amplification, intraspecific 
polymorphisms, and stepwise mutation patterns. 

• Conclusions: These microsatellite loci showed sufficient levels of variability and transferability to make them useful for popu- 
lation genetic, selection signature, and interspecific gene flow studies in Phoenix and other Coryphoideae genera. 

Key words: Arecaceae; Coryphoideae; microsatellite/SSR mining; Phoenix dactylifera; transferability. 



The date palm (Phoenix dactylifera L.) is a monocotyledon 
species belonging to the Arecaceae family, and is widely culti- 
vated in North Africa, the Sahel (from the Atlantic to the Red 
Sea), the Middle East, and eastward to the Indus Valley. The 
date palm is well adapted to cultivation in arid and semiarid 
areas, and it has been introduced in warm and dry regions 
worldwide. Mainly grown for its fruits, the date palm represents 
an important ecological and socioeconomic resource. 

Despite the increasing number of studies on date palm, there are 
still not enough molecular markers available for a number of ap- 
plications. Most published microsatellite or simple sequence re- 
peat (SSR) markers are dinucleotide loci from unknown noncoding 
regions of the genome, generally isolated from microsatellite- 
enriched DNA libraries (Billotte et al., 2004; Arabnezhad et al., 
2012). The increasing amount of available genome sequence 
data offers new prospects for microsatellite marker develop- 
ment through in silico mining, a promising approach for date palm 
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(Cherif et al., 2013), based on the recently published date palm 
genome sequence (Al-Dous et al., 2011) and expressed sequence 
tags (ESTs) (Zhao et al., 2012). Our aim was to develop new mark- 
ers from coding sequences to ensure clear stepwise mutation pat- 
terns usable for genetic diversity, dating, and selection signature 
analyses, and also to facilitate transferability to other species. 



METHODS AND RESULTS 

In silico microsatellite mining and primer design were performed on the 
date palm genome draft sequence version 2 (Al-Dous et al., 201 1), with the Perl 
script SSR_pipeline-v2.pl (Poncet et al., 2006), which incorporates three free 
software programs: Tandem Repeats Finder (Benson, 1999), Primer3 (Rozen 
and Skaletsky, 2000), and BLAST (Altschul et al., 1990). The multi-FASTA 
file of all 19,414 predicted genes (full and partial; PDK20.mRNA.fsa) and the 
multi-FASTA file with all scaffold sequences (PDK20.fsa) from version 2 of the 
date palm genome research program at Weill Cornell Medical College in Qatar 
were downloaded from http://qatar-weill.comell.edu/research/datepalmGenome/ 
download.html. The search identified 204 genes containing coding sequences 
with microsatellites, 150 of which were suitable for primer design, but only 103 
had nonduplicated primer anneafing sites. Among them, we retained loci hav- 
ing perfect trinucleotide motifs with six (excluding those without annotation) or 
more (with or without annotations) repeats, and hexanucleotide motifs with at 
least four repeats (with or without annotation). 

Of the 47 primer pairs finally retained, 33 generated expected PCR amplifica- 
tion patterns in a preliminary test with eight P. dactylifera individuals (Table 1). 
The 33 loci were further tested on 16 individuals representing P. dactylifera (7), 
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P. reclinata Jacq. (2), P. roebelenii O'Brien (2), P. rupicola T. Anderson (2), 
P. theophrasti Greuter (2), and the interspecific hybrid P. canariensis x P. sylves- 
tris (Table 2). Among these loci, 15 showed consistent ampHfication and prom- 
ising polymorphisms across the sample and were further investigated in a 
variable number of individuals (80-1000) of the aforementioned species, in- 
cluding population samplings of P. dactylifera and P. reclinata. The transfer- 
ability of 10 loci was also evaluated in Chamaerops humilis L., resulting in 
100% positive amplification, with eight polymorphic loci displaying two to 12 
alleles among seven to 51 individuals (Table 3). Moreover, the ampHfication of 
one Hyphaene thebaica Mart, individual and one Livistona carinensis (Chiov.) 
J. Dransf. & N.W. Uhl individual was tested for five loci, with both species 
giving positive amplification results in three loci (mPdIRD25, mPdIRD31, and 
mPdIRD33). 

DNA from these individuals was extracted from freeze-dried or silica-dried 
leaf tissue. Samples were reduced into a fine powder using either an IKA AlO 
analytical grinder (IKA-Werke, Staufen, Germany) or a QIAGEN TissueLyser 
and QIAGEN DNeasy Plant Mini, Maxi, or 96-well kits (QIAGEN, Courtaboeuf, 
France). PGR reactions were performed in a thermocycler (Biometra GmbH, 
Gottingen, Germany, or Eppendorf AG, Hamburg, Germany) in a total reaction 
mixture of 25 jiL, containing: 10 ng of total genomic DNA, Ix PGR buffer, 
2 mM MgCl2, 200 |iM dNTP, 0.5 U of Taq DNA polymerase, 0.4 pmol of the for- 
ward primer labeled with a 5' Ml 3 tail, 2 pmol of the reverse primer, and 2 pmol of 
the fluorochrome-marked Ml 3 tail, plus sterile water to reach the final volume. The 



fluorochromes used were either 6-FAM, HEX, or TAMRA. The PGR parameters 
were as follows: denaturation for 2 min at 94°C; followed by six cycles at 94°C for 
45 s, 60°C for 1 min, and 72°C for 1 min; then 30 cycles at 94°C for 45 s, 55°C for 
1 min, and 72°C for 1.5 min; then 10 cycles at 94°C for 45 min, 53°C for 1 min, 
72°C for 1.5 min; and a final elongation step at 72°C for 10 min. 

The PGR products were processed on an ABI 3130XL Genetic Analyzer 
(Applied Biosystems, Foster City, California, USA). Allele size scoring was 
performed with respect to a noncommercial ladder using GeneMapper version 
3.7 software (Applied Biosystems). 

Genetic analyses (number of alleles, observed and expected heterozygosi- 
ties, Wright's fixation index [Fi^] and its significance calculated using the per- 
mutation test) were conducted with GENETIX version 4.05 software (Belkhir 
et al, 2004). 

Each of the 15 loci tested were polymorphic in at least one Phoenix spe- 
cies (Tables 2 and 3). The loci mPdIRD25, mPdIRD30, mPdIRD31, mPdIRD33, 
and mPdIRD40 were particularly suitable in P. dactylifera with three to 
eight alleles, having a clear stepwise mutation pattern in accordance with 
the microsatellite motif (tri- or hexanucleotide), and showing little to mod- 
erate heterozygosity deficit. The loci mPdIRD13, mPdIRD25, mPdIRD31, 
and mPdIRD33 were useful in Chamaerops humilis with three to 12 al- 
leles, confirming good intergeneric transferability. In addition, mPdIRD25, 
mPdIRD31, and mPdIRD33 were amplified in Livistona carinensis and Hy- 
phaene thebaica. 



Table 2. Test of functionality of the 33 loci across the Phoenix genus.' 



Locus 


Pdac (7) 


Prec (2) 


Proe (2) 


Prup (2) 


Pthe (2) 


Phyb (1) 


All (16) 


SMb 


Locus comment 


mPdIRDOl 


M 


M 


M 


M 


M 


M 


M 




100% amplification, monomorphic 


mPdIRD03 


P 


M 


M 


M 


Failed 


Failed 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD04 


M 


M 


M 


M 


M 


M 


P 


3 


100% amplification, interspecific polymorphism 


mPdIRD05 


M 


M 


M 


M 


M 


M 


P 


No 


100% amplification, interspecific polymorphism 


mPdIRD07 


M 


M 


M 


M 


P 


M 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD08 


M 


Failed 


Failed 


Failed 


Failed 


Failed 


M 




Partial amplification, monomorphic 


mPdlRDlO 


P 


P 


M 


Failed 


M 


Failed 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRDll 


P 


P 


P 


M 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD13 


P 


P 


P 


M 


P 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD14 


M 


Failed 


Failed 


Failed 


Failed 


Failed 


M 




Partial amplification, monomorphic 


mPdIRD15 


M 


M 


M 


M 


M 


P 


P 


No 


100% amplification, interspecific polymorphism 


mPdIRD16 


P 


M 


M 


M 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD17 


M 


M 


M 


M 


M 


P 


P 


3 


100% amplification, interspecific polymorphism 


mPdIRD20 


M 


P 


M 


M 


P 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD22 


M 


M 


M 


P 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD24 


M 


M 


M 


M 


M 


M 


M 




100% amplification, monomorphic 


mPdIRD25 


P 


P 


M 


M 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD26 


P 


M 


M 


M 


M 


M 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD28 


P 


M 


P 


M 


M 


M 


P 


No 


100% amplification, intra- or interspecific polymorphism 


mPdIRD29 


P 


P 


M 


Failed 


Failed 


Failed 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD30 


P 


P 


Failed 


Failed 


M 


Failed 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD31 


P 


P 


M 


M 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD32 


M 


P 


M 


M 


M 


P 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD33 


P 


P 


M 


M 


M 


M 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD35 


M 


M 


M 


M 


M 


M 


M 




100% amplification, monomorphic 


mPdIRD36 


M 


P 


M 


M 


M 


P 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD37 


P 


P 


P 


P 


M 


P 


P 


3 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD40 


P 


P 


M 


M 


P 


P 


P 


3 


100% amplification, intra- or interspecific polymorphism 


mPdIRD42 


P 


Failed 


Failed 


Failed 


Failed 


Failed 


P 


No 


Partial ampHfication, intra- or interspecific polymorphism 


mPdIRD43 


P 


M 


M 


M 


M 


M 


P 


6 


100% amplification, intra- or interspecific polymorphism 


mPdIRD44 


P 


Failed 


Failed 


Failed 


Failed 


Failed 


P 


No 


Partial amplification, intra- or interspecific polymorphism 


mPdIRD45 


M 


Failed 


Failed 


Failed 


Failed 


Failed 


M 




Partial amplification, monomorphic 


mPdIRD46 


P 


P 


P 


P 


P 


P 


P 


6 


100% amplification, intra- and interspecific polymorphism 



Note: M = monomorphic; P = polymorphic; Pdac = Phoenix dactylifera; Prec = Phoenix reclinata; Proe = Phoenix roebelenii; Prup = Phoenix rupicola; 
Pthe = Phoenix theophrasti; Phyb = Phoenix canariensis x Phoenix sylvestris; SM = stepwise mutation pattern. 

^Species abbreviations are presented with the number of samples tested in parentheses. Herbarium voucher information: Pdac = dad: cultivated, Kew, 
United Kingdom, MWC 1395 (K); dac2: cultivated, Elche, Spain, cv. 'Zahidi', MWC 1800/Barrow 77 (K); dac3: cultivated, Kew, MWC 1891 (K); dac4: 
cultivated, Kew, MWC 1398/Kew 1987-3379 (K); dac5: cultivated, Kew, MWC 1164 (K); dac6: feral, Gran Canada, Pintaud 636 (G); dac7: cultivated 
Faisalabad, Pakistan, cv. 'Khadrawy', Pintaud 648 (G); Prec = reel: Djibouti, Pintaud 642 (G); rec2: Zimbabwe, MWC 1874AVilkin 724 (K); Proe = roel: 
cultivated, Thailand, MWC 1161/Barrow 26 (K); roe2: cultivated, United Kingdom, MWC 1400/Kew 1987-530; Prup = rupl: cultivated, United Kingdom 
(from India), Pintaud 586 (G); rup2: Samchi, Bhutan, MWC 1162/Grierson and Long 3414 (K); Pthe = thel: cultivated, Sanremo, Italy, Pintaud 646 (G); 
Phyb = cultivated, Sanremo, Italy, no. 91005. 

'^In cases where stepwise mutation occurs, the number of base pairs of the repeat unit is given. 
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Table 3. Polymorphism characterization for 15 loci in Phoenix and 10 loci in Chamaerops. 



Locus 


Phoenix all/Pdac/Prec 




Phoenix dcLctylifevcL 




Chamaerops humilis 


AT 


/I 


Ho 






AT 


A 

/i 


mPdlRDll 


18/9/2 


2/2/2 












mPdlRDn 


700/560/25 


10/2/4 








7 


4 


mPdlRDlo 


100/87/2 


3/2/1 








7 


2 


niPdIRD20 


100/87/2 


5/1/2 








7 


5 


niPdlRD22 


100/87/2 


5/1/1 












mPdIRD25 


300/108/60 


5/4/2 


0.29 


0.42 


0.31* 


51 


3 


mPdIRD28 


184/108/15 


9/4/3 


0.06 


0.44 


0.85* 






mPdIRD30 


83/28/15 


4/3/2 


0.11 


0.10 


-0.04 






mPdIRD31 


850/573/85 


12/4/4 


0.19 


0.20 


0.03 


51 


3 


mPdIRD32 


186/108/15 


6/1/4 








51 


2 


mPdIRD33 


1000/618/85 


12/4/8 


0.19 


0.23 


0.16* 


51 


12 


mPdIRD36 


186/108/15 


5/1/3 








51 


1 


mPdIRD40 


1000/645/85 


11/8/6 


0.47 


0.53 


0.11* 


51 


2 


mPdIRD43 


100/87/2 


2/2/1 








7 


1 


mPdIRD46 


80/32/5 


6/3/3 













Note: A = number of alleles; = fixation index for inbreeding within populations; = expected heterozygosity; = observed heterozygosity; N = 
number of individuals tested; Pdac = Phoenix dactylifera; Phoenix all = all individuals of seven Phoenix species; Prec = Phoenix reclinata. 
* Significant departure from Hardy-Weinberg equilibrium. 



CONCLUSIONS 

The loci described here are a useful addition to previously 
published microsatellite markers for palms. Their interspecific 
allelic differentiation makes them particularly suitable for hy- 
brid and gene flow analysis within Phoenix. The most polymor- 
phic loci can be added to other SSR loci to create marker sets 
for genetic diversity analysis in P. dactylifera and other species. 
Their transferability within the Coryphoideae subfamily will 
facilitate the study of species with limited molecular resources, 
such as Chamaerops humilis. 
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